Speech-in-noise enhancement using amplification and dynamic range compression controlled by the speech intelligibility index.

نویسندگان

  • Henning Schepker
  • Jan Rennies
  • Simon Doclo
چکیده

In many speech communication applications, such as public address systems, speech is degraded by additive noise, leading to reduced speech intelligibility. In this paper a pre-processing algorithm is proposed that is capable of increasing speech intelligibility under an equal-power constraint. The proposed AdaptDRC algorithm comprises two time- and frequency-dependent stages, i.e., an amplification stage and a dynamic range compression stage that are both dependent on the Speech Intelligibility Index (SII). Experiments using two objective measures, namely, the extended SII and the short-time objective intelligibility measure (STOI), and a formal listening test were conducted to compare the AdaptDRC algorithm with a modified version of a recently proposed algorithm in three different noise conditions (stationary car noise and speech-shaped noise and non-stationary cafeteria noise). While the objective measures indicate a similar performance for both algorithms, results from the formal listening test indicate that for the two stationary noises both algorithms lead to statistically significant improvements in speech intelligibility and for the non-stationary cafeteria noise only the proposed AdaptDRC algorithm leads to statistically significant improvements. A comparison of both objective measures and results from the listening test shows high correlations, although, in general, the performance of both algorithms is overestimated.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Real-time Speech-in-noise Intelligibility Enhancement Based on Spectral Shaping and Dynamic Range Compression

We demonstrate a real-time implementation of a speech-innoise intelligibility enhancement algorithm based on spectral shaping and dynamic range compression. The signal is enhanced before presented in a noisy environment, under the constraint of equal global signal power before and after modifications. The demonstrator modifies in real-time prerecorded sentences as well as input from a microphon...

متن کامل

Improving speech intelligibility in background noise by SII-dependent amplification and compression

In many speech communication applications it is of great interest to achieve a high intelligibility to ensure good communication. However, in these applications speech is often disturbed by additive noise and/or reverberation. Therefore, it is desirable to develop algorithms that are able to maintain a high intelligibility in such disturbed scenarios. While amplifying the speech to achieve good...

متن کامل

Effect of compressing the dynamic range of the power spectrum in modulation filtering based speech enhancement

In the modulation-filtering based speech enhancement method, noise suppression is achieved by bandpass filtering the temporal trajectories of the power spectrum. In the literature, some authors use the power spectrum directly for modulation filtering, while others use different compression functions for reducing the dynamic range of the power spectrum prior to its modulation filtering. This pap...

متن کامل

Can modified casual speech reach the intelligibility of clear speech?

Clear speech is a speaking style adopted by speakers in an attempt to maximize the clarity of their speech and is proven to be more intelligible than casual speech. This work focuses on modifying casual speech to sound as intelligible as clear speech. First, we examine the role of speaking rate for intelligibility. Clear and casual speech signals are time-scale stretched, matching the average d...

متن کامل

Speech Enhancement using Adaptive Data-Based Dictionary Learning

In this paper, a speech enhancement method based on sparse representation of data frames has been presented. Speech enhancement is one of the most applicable areas in different signal processing fields. The objective of a speech enhancement system is improvement of either intelligibility or quality of the speech signals. This process is carried out using the speech signal processing techniques ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • The Journal of the Acoustical Society of America

دوره 138 5  شماره 

صفحات  -

تاریخ انتشار 2015